DAGGER: A New Approach to Combining Multiple Models Learned from Disjoint Subsets

نویسندگان

  • Winton Davies
  • Pete Edwards
چکیده

We introduce a new technique for combining multiple learned models. This technique results in a single comprehensible model. This is to be contrasted with current methods that typically combine models by voting. The core of the technique, the DAGGER (Disjoint Aggregation using Example Reduction) algorithm selects examples which provide evidence for each decision region within each local model. A single model is then learned from the union of these selected examples. We describe experiments on models learned from disjoint training sets that show: (i) DAGGER performs as well as weighted voting on this task; (ii) DAGGER extracts examples which are more informative than those that can be selected at random. The experiments were conducted on models learned from disjoint subsets generated with a uniform random distribution. DAGGER is actually designed for use on naturally distributed tasks, with non-random distribution. We discuss how one view of the experimental results suggests that DAGGER should work well on this type of problem.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

DAGGER: Using Instance Selection to Combine Multiple Models Learned from Disjoint Subsets

We introduce a novel instance selection method for combining multiple learned models. This technique results in a single comprehensible model. This is to be contrasted with current methods that typically combine models by voting. The core of the technique, the DAGGER (Disjoint Aggregation using Example Reduction) algorithm selects examples which provide evidence for each decision region within ...

متن کامل

Distributed Learning on Very Large Data Sets

One approach to learning from intractably large data sets is to utilize all the training data by learning models on tractably sized subsets of the data. The subsets of data may be disjoint or partially overlapping. The individual learned models may be combined into a single model or a voting approach may be used to combine the classi cations of a set of models. An approach to learning models in...

متن کامل

A Principal Components Approach to CombiningRegression

The goal of combining the predictions of multiple learned models is to form an improved estimator. A combining strategy must be able to robustly handle the inherent correlation, or multicollinearity, of the learned models while identifying the unique contributions of each. A progression of existing approaches and their limitations with respect to these two issues are discussed. A new approach, ...

متن کامل

Model Combination in the Multiple-Data-Batches Scenario

The approach of combining models learned from multiple batches of data provide an alternative to the common practice of learning one model from all the available data (i.e., the data combination approach). This paper empirically examines the base-line behaviour of the model combination approach in this multiple-data-batches scenario. We nd that model combination can lead to better performance e...

متن کامل

Combining Decision Trees Learned in Parallel

Very large data sets may be utilized for visualization To focus attention on the salient regions of a data set being visualized it is useful to have information on the interesting regions of data It is possible to learn the salience of regions of data but very slow if possible to do so serially on currently available terabyte plus datasets This paper describes an approach in which decision tree...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000